AITopics | repeated inverse reinforcement learning

Collaborating Authors

repeated inverse reinforcement learning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Repeated Inverse Reinforcement Learning

Neural Information Processing SystemsNov-21-2025, 15:48:27 GMT

We introduce a novel repeated Inverse Reinforcement Learning problem: the agent has to act on behalf of a human in a sequence of tasks and wishes to minimize the number of tasks that it surprises the human by acting suboptimally with respect to how the human would have acted. Each time the human is surprised, the agent is provided a demonstration of the desired behavior by the human. We formalize this problem, including how the sequence of tasks is chosen, in a few different ways and provide some foundational results.

electronic proceedings, name change, repeated inverse reinforcement learning, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.73)

Add feedback

Reviews: Repeated Inverse Reinforcement Learning

Neural Information Processing SystemsOct-8-2024, 03:46:41 GMT

The authors present a learning framework for inverse reinforcement learning wherein an agent provides policies for a variety of related tasks and a human determines whether or not the produced policies are acceptable or not. They present algorithms for learning a human's latent reward function over the tasks, and they give upper and lower bounds on the performance of the algorithms. They also address the setting where an agent's is "corrected" as it executes trajectories. This is a comprehensive theoretical treatment of a new conceptualization of IRL that I think is valuable. I have broad clarification/scoping questions and a few minor points.

algorithm, optimal policy, repeated inverse reinforcement learning, (6 more...)

Neural Information Processing Systems

Genre: Summary/Review (0.74)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Repeated Inverse Reinforcement Learning

Amin, Kareem, Jiang, Nan, Singh, Satinder

Neural Information Processing SystemsFeb-14-2020, 08:58:28 GMT

agent, repeated inverse reinforcement learning, sequence

Neural Information Processing Systems

Industry: Education > Focused Education > Special Education (0.33)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback